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ABSTRACT 

Gamma-glutamylcysteine synthetase encoded by 
GSH1 is the rate-limiting enzyme in the biosynthesis 
of glutathione and trypanothione in Leishmania. 
Attempts to generate GSH1 null mutants by gene 
disruption failed in Leishmania infantum. Removal 
of even a single allele invariably led to the gener- 
ation of an extra copy of GSH1, maintaining two 
intact wild-type alleles. In the second and even 
third round of inactivation, the markers integrated 
at the homologous locus but always preserved two 
intact copies of GSH1. We probed into the mechan- 
ism of GSH1 duplication. GSH1 is subtelomeric 
on chromosome 18 and Southern blot analysis 
indicated that a 10-kb fragment flanked by 466-bp 
direct repeated sequences was duplicated in 
tandem on the same chromosomal allele each time 
GSH1 was targeted. Polymerase chain reaction 
analysis and sequencing confirmed the generation 
of novel junctions created at the level of the 466-bp 
repeats consequent to locus duplication. In loss 
of heterozygosity attempts, the same repeated 
sequences were utilized for generating extra- 
chromosomal circular amplicons. Our results are 
consistent with break-induced replication as a 
mechanism for the generation of this regional poly- 
ploidy to compensate for the inactivation of an es- 
sential gene. This chromosomal repeat expansion 
through repeated sequences could be implicated 
in locus duplication in Leishmania. 



INTRODUCTION 

The protozoan parasite Leishmania is the causative agent 
for leishmaniasis, and it belongs to the Kinetoplastida, 
one of the oldest eukaryotic lineages. Leishmania has a 
plastic genome and this was first illustrated by diverse 
karyotypes when different species of Leishmania were 
compared (1). Leishmania is considered as a diploid 
organism but several studies have shown that a portion 
of its genome can become aneuploid (2-A). Recently, 
fluorescence in situ hybridization studies have shown 
that aneuploidy appears to be common with variable 
chromosomal ploidy among individual cells in 
Leishmania (5). Gene amplification as a part of linear 
minichromosomes or extrachromosomal circles is also 
frequent in Leishmania and is a manifestation of genome 
plasticity (6-8). This kind of gene amplification can be 
found in unselected stocks or after drug selection (6,9). 
These gene rearrangements usually occur at the level of 
direct or inverted repeats (3,4,10,11). Another manifest- 
ation of genome plasticity is in the attempts to generate 
null mutant of reputed essential genes. Indeed, upon in- 
activation, gene rearrangement takes place maintaining at 
least one intact allele of the gene to be targeted. This has 
been reported abundantly (2,12,13), although the mechan- 
ism to generate this ploidy has not been studied into 
details, one study highlighted a change in chromosomal 
ploidy (2) and in another there was a translocation of a 
chromosomal segment to another chromosome (12). 

Recently, we have attempted to generate a null mutant 
of the gene GSH1 encoding gamma-glutamylcysteine 
synthetase (y-GCS), the rate-limiting enzyme in the bio- 
synthesis of glutathione in Leishmania. We reported that 
GSH1 is an essential gene in Leishmania (14). Indeed, all 



*To whom correspondence should be addressed. Tel: +1 418 654 2705; Fax: +1 418 654 2715; Email: marc.ouellette@crchul.ulaval.ca 
© The Author(s) 2011. Published by Oxford University Press. 

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ 
by-nc/3.0). which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. 



7500 Nucleic Acids Research, 2011, Vol. 39, No. 17 



our attempts to generate chromosomal null mutants failed 
except if cells were first provided with a rescue GSH1 
plasmid encoding y-GCS. While generation of polyploidy 
is frequently observed when attempting disruption of 
essential genes in Leishmania (2,12), in the case of 
GSH1, the deletion of one single allele also led to gene 
rearrangements. In this current study, we provide 
evidence for a mechanism of intrachromosomal gene 
duplication where the GSH1 locus becomes polyploid 
preserving two intact copies of the essential gene upon 
sequential rounds of gene inactivation. 

MATERIALS AND METHODS 

Strains and culture conditions 

Promastigotes of Leishmania infantum (MHOM/MA/67/ 
ITMAP-263) and its transfectants were grown in SDM-79 
medium supplemented with 10% fetal bovine serum, 
5 jig/ml of hemin at pH 7.0 and 25°C. 

DNA constructs and transfection 

GSH1 inactivation cassettes with hygromycin phospho- 
transferase B, neomycin phosphotransferase, blasticidin 
deaminase for L. infantum were constructed using a 
polymerase chain reaction (PCR) fusion-based strategy 
and transfected as described previously (14). The 
LinJ_V3.1670 inactivation cassettes were also generated 
by a similar PCR fusion-based strategy with hygromycin 
and neomycin phosphotransferases as the selectable 
markers. Two micrograms of linear fragments for trans- 
fection were obtained by PCR amplification, gel purified 
and transfected into promastigotes by electroporation 
(15). Recombinants were preselected initially in the 
presence of 200ug/ml hygromycin, 20(.ig/ml G418 
(Geneticin, Gibco-BRL) or 50ug/ml blasticidin S 
(Invitrogen). After 24 h, the transfected cells were grown 
in the presence of higher drug concentrations and cells 
growing in highest drug selection (600 ug/ml hygromycin, 
80ug/ml G418; lOOug/ml blasticidin) were cloned. 

Pulsed-fleld gel electrophoresis 

Intact chromosomes were prepared from Leishmania 
promastigotes harvested from late log phase, washed 
and lysed in situ in 1% low melting agarose plugs. 
Briefly, cells were resuspended in HEPES-NaCl buffer at 
a density of 5 x 10 8 cells/ml and mixed with equal volume 
of low melting-point agarose. Cells were lysed in the 
presence of 0.5 M ethylenediaminetetraacetic acid 
(EDTA; pH 9.5), 1% sodium dodecyl sulfate (SDS) and 
350ug/ml proteinase K overnight at 50°C. For Hpal 
digestion of chromosomes embedded in agarose blocks, 
the blocks were washed twice in 25 vol of TE to remove 
proteinase K. The blocks were then equilibrated in restric- 
tion buffer at room temperature for 30min after which 
fresh buffer was added and 100 U of Hpal in a reaction 
volume of 200 ul were incubated at 37°C overnight. 
Leishmania chromosomes were separated by Pulsed-field 
gel electrophoresis (PFGE) using a Bio-Rad CHEF-DR 
III apparatus at 5V/cm, 120° separation angle as 



described previously (12). The range of chromosome sep- 
aration was between 100 and 900 kb and between 500 and 
1000 kb depending on the conditions used. Digested 
chromosomes were separated under conditions optimal 
for separating between 25 and lOOkb for 27 h. 

Southern blot analysis 

Genomic DNA of the clones was isolated using DNAzol 
(Invitrogen) and circular DNA was isolated by Promega 
Wizard miniprep kit following the manufacturer's 
instructions. Digested genomic DNA or circular DNA 
or separated chromosomes were subjected to Southern 
blot hybridization with [a- 32 P]dCTP-labeled DNA accord- 
ing to standard protocols (16). All probes were obtained 
by PCR from Leishmania genomic DNA. Densitometric 
analyses of Southern blots were performed using Image J 
and Agfa Arcus 2 scanner. 

RESULTS 

GSH1 polyploidy following gene disruption attempts 

The GSH1 gene is single copy (17) and essential in 
Leishmania (14). Removal of even a single wild-type 
(WT) allelic copy of GSH1 invariably led to the generation 
of an extra copy of GSH1, maintaining two intact WT 
alleles (14). As demonstrated previously (14) successive 
integration of the selectable markers hygromycin 
phosphotransferase (HYG), neomycin phosphotransferase 
(NEO) and blasticidin deaminase (BLA) in the GSH1 gene 
of L. infantum (Figure 1A) (14) led not only to the inte- 
gration of all the three markers in the GSH1 locus but also 
to preservation of intact GSH1 alleles (14). This was 
further confirmed with additional restriction digestions 
and Southern blots (Figure 1). A 3.0-kb band was 
observed in WT cells when hybridized to a 550-bp probe 
within the 5' flank of GSH1 (Figure IB). Upon sequential 
integration of the HYG. NEO or BLA resistance cassettes, 
digestion with Hindlll and probing with the 5' flank 
GSH1 probe of 550 bp yielded 5.2, 5.0 and 4.7 kb 
hybridizing bands, respectively, in addition to the WT 
band of 3.0 kb (Figure IB). Densitometric analyses were 
done with Image J software and values were represented 
as Net Integrated Optical density after background sub- 
traction which revealed a 2:1 ratio for WT alleles 
compared to the HYG, NEO or BLA alleles (Figure IB). 

In an attempt to further characterize the events leading 
to polyploidy at the GSH1 locus, we separated the 
chromosomes of the WT and recombinants by PFGE. 
Ethidium bromide staining of the gel did not reveal 
the presence of linear amplicons or gross karyotypic 
changes in the recombinants due to GSH1 rearrangement 
(Figure 2A) and hybridizing to a GSH1 probe indicated 
that the gene was present on the 720-kb chromosome for 
all recombinants with no evidence for circular amplifica- 
tion or gross translocation elsewhere on the genome 
(Figure 2A). Integration of the HYG, NEO and BLA 
markers at the GSH1 locus was further confirmed by 
hybridization with no evidence for gross gene rearrange- 
ments (Figure 2B-D). Chromosome 18 is disomic in the 
L. infantum strain studied. Indeed, we could easily obtain 
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Figure 1. GSH1 gene inactivation in L. infantum. (A) Schematic 
drawing of the GSH1 locus in L. infantum before and after integration 
of the inactivation cassettes (hygromyin phosphotransferase B, HYG; 
neomycin phosphotransferase, NEO; blasticidin deaminase, BLA) and 
the relevant Hindlll restriction sites (H). (B) Southern blot analysis 
with genomic DNA digested with Hindlll from the WT and recombin- 
ant clones and hybridization with a probe covering the 5' flank GSH1 
region. Molecular weights (M) are indicated on the left, the various 
alleles are pinpointed on the right and underneath is the raw data 
(Net integrated optical density values) of densitometric analysis of 
each allele as determined using the Image J software. 1, L. infantum 
WT, GSHljGSHl; 2-5, L. infantum with the following genotypes at the 
GSH1 locus: 2, GSH1/GSH1/HYG; 3, GSH1/GSH1/NEO; 4, GSHlj 
GSHl/NEOjHYG; 5, GSHl/GSHl/NEO/HYG/BLA. 



a GSH1 chromosomal null mutant in two rounds of trans- 
fection provided that a rescue plasmid was present (14). 
Additionally, we obtained a chromosomal null mutant of 
LinJ18_V3.1670, the gene downstream of GSH1 
(Figure 3), by two successive integration of NEO and 
HYG (Supplementary Figure SI). 

Gene rearrangements at the level of direct repeated 
sequences 

The generation of GSH1 polyploidy is neither due to 
translocation of a segment elsewhere on the genome nor 
by chromosomal aneuploidy. L. infantum promastigotes 
with the GSH1/GSH1/NEO/HYG genotype were selected 
for high concentration of G418 and hygromycin B in 
order to try to induce loss of heterozygosity (18) but this 
failed and instead extrachromosomal circular elements 
with the NEO and HYG markers were generated (14). 
Extrachromosomal circles are often formed by homolo- 
gous recombination (HR) between direct repeated 



sequences (7,10,11) and this prompted us to search for 
such repeated sequences in the vicinity of the GSH1 
gene. We discovered direct repeats of 466 bp with 98.9% 
identity flanking a region of 10.136kb and encompassing a 
region with four genes (Figure 3A). Interestingly, the last 
100 bp of the 3' end of the GSH1 inactivation cassette 
construct were part of the 466-bp repeated sequences. 
The extrachromosomal circular elements derived from 
these cells were isolated (Figure 3B, extreme left), 
digested with Ncol and hybridized to the HYG and 
NEO probes, and after longer migration to the 5' flank 
GSH1 probe. Both the HYG and NEO alleles were 
amplified as circles giving rise to the expected restriction 
fragments after Ncol digestion (Figure 3B, middle panels). 
To ascertain that the extrachromosomal circular amplicon 
was generated by HR between the 466-bp direct repeated 
sequences, a pair of primers (la and lb) was designed to 
amplify the new junction of the circle generated by recom- 
bination from the DNA isolated by alkaline lysis (Figure 
3A and C). PCR amplification with this set of primers 
indeed amplified the expected 850-bp product from the 
DNA isolated from GSH1 recombinant cells subjected 
to high concentration of the drugs and not from WT 
under the conditions tested (Figure 3D). Sequencing of 
this amplified fragment confirmed that the circles were 
formed by HR between the 466-bp repeats (data not 
shown). 

We next investigated whether the direct 466-bp repeated 
sequences also played a role in locus duplication following 
targeting of GSH1. The copy number of neighboring 
genes of GSH1 on chromosome 18 was investigated by 
digesting genomic DNA from the WT and recombinants 
with Ncol and by Southern blot hybridization using 
nearby genes as probes (Figure 4A). Restriction fragments 
hybridizing to the probes used are shown in Figure 4A, the 
superscripts indicating the panel number in Figure 4B 
and C. We used probes recognizing genes upstream, 
within and downstream of the 466-bp repeated sequences. 
LinJ23_V3.0310 (PTR1) was used as a control for normal- 
ization (Figure 4B; panel, control). The copy number of 
genes Lin J 18 _V 3. 1600 to Lin J 18 _V 3. 1620 (upstream the 
direct repeated sequence) remained diploid (Figure 4B, 
panels a-c) and similarly the copy number of the genes 
LinJ18_V3.1670 and Lin J 18 _V 3. 1680, downstream the 
repeated sequences were unchanged (Figure 4B, panels d 
and e). However, when the blots were probed with 
LinJ18_V3.1630 and Lin J 18 _V 3. 1640, we observed an 
increase in copy number of the gene by one copy in the 
mutants every time GSH1 was targeted in the parental 
strain. For example in GSH1/GSH1/HYG (Figure 4C, 
panels i and ii lanes 2) and GSH1/GSH1/NEO 
(Figure 4C, panels i and ii lanes 3), densitometric 
analyses revealed that the intensity of the bands were 
approximately higher by 1.5 times than WT. 
Interestingly, the band intensities of Lin J 18 _V 3. 1630 and 
LinJ18_V3.1640 were ~2 and 2.5 times higher than WT in 
GSH1/GSH1/NEO/HYG and GSH1/GSH1/NEO/HYG/ 
BLA mutants respectively (Figure 4C, panels i and ii 
lanes 4 and 5). These blots (Figure 4C, panels i and ii) 
were normalized with LinJ18_V3.1620 (Figure 4C, panel 
control), a gene on chromosome 18 which did not change 
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Figure 2. GSH1 polyploidy is not due to gross translocation, circular/linear amplification or chromosomal aneuploidy. Pulsed-field gel 
electrophoresis was used for separating chromosomes between 100kb and 900 kb and the gels were visualized by ethidium bromide staining and 
transferred and hybridized to a GSH1 ORF (A), HYG (B), NEO (C), BLA (D) probes. Molecular weight markers (0.225-2.2 Mb S. cerevisiae 
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its copy number when GSH1 was targeted. Probing with 
LinJ18_V3.1650 yielded multiple bands because of the 
nearby integration of the resistance markers NEO, HYG 
and BLA (Figure 4C, panel iii). Densitometric analyses 
revealed that all mutants retained two WT alleles in 
addition to the integration of a copy of the markers 
(Figure 4C, panel iii and data not shown). When the blot 
was probed with LinJ18_V3.1660 (GSH1) we observed an 
identical copy number of the gene in WT and all recombin- 
ants (Figure 4C, panel iv). Digesting genomic DNA of 
these recombinants with Clal and probing with the same 
genes between or outside the repeats essentially yielded 
similar results (data not shown). The results suggested 
that the region between the 466-bp repeated sequences 
was duplicated after each attempt of inactivating GSH1. 



Chromosomes of WT parasites and recombinants were 
separated by PFGE and Southern blot analysis was per- 
formed with probes covering LinJ18_V3.1620 to 
LinJ18_V3.1670 (Figure 5A) and probes specific for each 
resistance marker (Figure 5B). The targeting of either 
HYG or NEO marker in WT indicated that it integrated 
into chromosome 18 (Figure 5B, lanes 2 and 3) and that 
chromosome 1 8 remained intact but with a slight increase 
in intensity as determined by hybridization with four 
probes within the repeats (Figure 5 A, lanes 2 and 3). 
Targeting of the HYG marker in the GSH1 locus in the 
GSH1/GSH1/NEO parasites led also to the integration 
of the HYG cassette in chromosome 18 but we observed 
a further increase in the intensity in the hybridiza- 
tion signal with the four probes LinJ18_V3.1630 to 
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Figure 3. Formation of extrachromosomal circles by homologous recombination between direct repeated sequences. (A) Schematic drawing of the 
GSH1 locus in L. infantum, before and after integration of the HYG and NEO cassettes with the direct repeated sequences of 466 bp as small black 
boxes. Genes within the repeated sequences are represented as 1, 2, 3 and 4 where 4 is LinJ18_V3.1660 (GSH1). The gene upstream the repeat is 
represented as —1 (LinJ18_V3.l620) and the gene downstream the repeat is represented as 5 (LinJ18_V3.l670). The sizes of the Ncol digests are 
shown as dotted lines. H, hygromycin phosphotransferase B; N, neomycin phosphotransferase. (B) Isolation of circular DNA amplicons using the 
Promega Wizard Plus kit. The white arrow represents the presence of extrachromosomal circular DNA (extreme left, lanes 3 and 4). The plasmid was 
digested with Ncol (second gel from the left), followed by Southern blot hybridization with HYG and NEO probes. The Ncol digested DNA was 
also run longer and hybridized with the 5' flank GSH1 (right panel) (C) Model for the generation of the extrachromosomal circular amplicon, formed 
by HR between the 466-bp direct repeated sequences. (D) PCR amplification with primers la and lb as shown in (A) and (C) were used to amplify 
the 850-bp fragment from circular DNA isolated by alkaline lysis. 1, WT (GSHJ/GSHl); 2, GSH1/GSH1/NEO/HYG; 3, GSH1/GSH1/NEO/HYG 
grown in the presence of 10 X EC 50 of the selective drugs; 4, WT transfected with an episomal vector (PspaZeoa-GSHl) as positive control for 
plasmid isolation. 
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3' LinJ 18 _V 3. 1660 (550 bp) that are within the direct 
repeated sequences but not with the genes 
Lin J 18 _V3. 1620 and LinJ 18 _V3. 1670 just outside the 
repeats (Figures 4A and 5A, lanes 4). However, with all 
these probes we saw a small size upper shift in the 
hybridizing band due to an increase in size of the chromo- 
some. This shift was also observed with the HYG and 
NEO hybridization (Figure 5B, lanes 4). This was vali- 
dated further with the integration of the BLA marker in 
the GSH1 1 GSH1 /NEO /H YG parasites. In this case, two 



homologs of chromosome 18 were clearly observed as 
discrete bands, the band with the higher molecular 
weight hybridizing with the HYG, NEO and BLA probes 
(Figure 5B, lanes 5) and the four genes within the repeats 
clearly giving rise to a stronger hybridization signal on the 
upper band compared to the two genes outside the repeats 
{LinJ 18 _V3. 1620 and LinJ '18 _V3. 1670) (Figure 5A, 
lanes 5). The data are compatible with intrachromosomal 
duplication of the GSH1 locus between the 466-bp repeats, 
after each inactivation attempt (Figure 6). To prove this, 
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Figure 6. Tandem duplication of the GSHl locus by homologous recombination between direct repeats. (A) Schematic diagram of the GSHl locus in 
WT L. infantum. Only one of the chromosomal homologs of the disomic chromosome is shown. Numbers in boxes correspond to the genes in 
vicinity of GSH1. The estimated size of restriction fragments following Ncol, Ndel (Nd) and Hpal (Hp) digestions are shown with dotted lines along 
with the length of the restriction fragments. Superscripts denote the panels in Figure 4 where those restriction fragments are demonstrated by 
Southern blots. (B-E) The rearrangement at the genomic locus of the mutants when GSH1 was targeted, either with HYG to form GSHI/GSHl/ 
HYG (B) or NEO to generate GSH1/GSH1/NEO (C). The GSH1/GSH1/NEO targeted with HYG to form GSH1 j GSH1 j NEO/H YG (D); and the 
GSHl j GSH1 /NEO/H YG targeted with BLA to generate GSH1 j GSHl/NEO/H YG/BLA (E). H, hygromycin phosphotransferase B; N, neomycin 
phosphotransferase; B, blasticidin deaminase. 



we first digested intact chromosomes in agarose blocks 
with Hpal, an enzyme which does not cut within the 
GSHl locus (Figure 6). These digests were run on 
CHEF and hybridized to genes located within the 
repeated sequences. Consistent with the scenario shown 
in Figure 6, all markers integrated into the same allele 
and increased the size of one chromosomal homolog by 
about 10 kb (the size of the GSHl locus between the 
466-bp repeats) after each round of targeting, keeping 
the other homolog intact (Figure 5C, lanes 2-5). Probing 
with the GSHl open reading frame (ORF) revealed an 
expected equal intensity in both chromosomal homologs 
in all mutants (Figure 5D). Hybridization with HYG, 
NEO and BLA confirmed that all the markers integrated 
into the same allele (Figure 5D). 

The four-gene locus between the 466-bp repeats has 
thus duplicated in tandem each time GSHl was targeted. 
In order to show that duplication is ordered, we digested 
the DNA of all the recombinants with Ndel, which cuts 
within the HYG and GSHl but neither within NEO and 
BLA nor elsewhere within the locus (Figure 6). Using a 
probe covering the first 1000 bp of GSHl (before the Ndel 
site), a Ndel digest should lead to a fragment of 12.7 kb in 
WT cells (Figure 6A) and, indeed, this was observed 
(Figure 7, lane 1). The same band was also observed in 
all recombinants, consistent with at least one intact GSHl 
allelic locus, (Figure 7A, lanes 2-5). Integration of 
HYG or NEO in the locus hybridized to the same GSHl 
probe should lead to 9-kb and 21-kb Ndel fragments 
(Figure 6B and C) only if the WT locus duplicated down- 
stream of integration and this was indeed observed experi- 
mentally (Figure 7A, lanes 2 and 3). If the order of 
duplication mimics the order of inactivation attempts we 
should obtain the intermediate structure shown in 
Figure 6D and the final product shown in Figure 6E. 



Digestion with Ndel and hybridization with the same 
GSHl probe would lead to hybridizing fragments of 
9kb and 17 kb in the GSHI/GSHl /NEO /HYG and 
GSHI/GSHl /NEO /HYG /BLA recombinants, respectively 
(Figure 6D and E) and this is what we have observed 
(Figure 7 A, lanes 4 and 5). Hybridization of these 
Ndel-digested DNA with a probe derived from the first 
434 bp of HYG (before the Ndel site) would yield a 12-kb 
fragment in GSH1/GSH1/HYG cells but a 21-kb fragment 
in the GSHI/GSHl /NEO /HYG and GSH1/GSH1/NEO/ 
HYG/BLA cells (Figure 6B, D and E) and this was 
indeed observed (Figure 7B). Hybridization data of the 
same digests with NEO and BLA probes (Figure 7C 
and D) were also totally consistent with the maps shown 
in Figure 6. This suggests that upon each round of target- 
ing, the WT allele is the one targeted and duplicated; so, the 
most recently targeted marker will be just upstream of the 
WT allele while additional markers will be more internal, 
in the order in which they were sequentially targeted. 

The data are thus consistent with the duplication of 
the four gene locus containing GSHl that are within 
the 466-bp direct repeated sequences. This duplication 
occurred a first time upon the integration of either the 
HYG (Figure 6B) or NEO marker (Figure 6C), a second 
time upon the integration of the HYG marker into a 
GSH1/GSH1/NEO line (Figure 6D) and a third time 
upon integration of the BLA marker when GSHl was 
targeted into a GSHl /GSH1/NEO /H YG line (Figure 6E). 
All these integrations took place on the same chromo- 
somal homolog explaining the increase in both the size 
and the hybridization intensity of chromosome 18 as 
seen in CHEF analysis upon each round of integration 
(Figure 5C and D). This process always maintains two 
copies of GSHl intact, despite repetitive targeting of 
GSHl and ordered integration of various markers used. 
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Figure 7. Duplication of the GSH1 locus as determined by Ndel digests and Southern blot analyses. Leishmania genomic DNA was digested with 
Ndel and probed with the initial 1000 bp of GSH1 ORF and 434 bp of HYG (up to their Ndel site) or complete ORFs of NEO and BLA. Molecular 
weight markers from High Range DNA ladder are indicated on the right. 1, WT L. infantum. GSHljGSHl; 2, GSH1/GSH1/HYG; 3, GSH1/GSH1/ 
NEO; 4, GSHljGSHl /NEO /HYG; 5, GSH1/GSH1/NEO/HYG/BLA. 
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Figure 8. Duplication of GSH1 locus at the level of direct repeats. (A) A pair of primers, la and lb can amplify a region of 850 bp when the GSH1 
locus has duplicated in tandem. To detect the novel junctions and the sequential duplication of the locus, forward primers were also designed before 
the end of the marker genes (H, N and B) as indicated by 2^1a and reverse primers in the middle of LinJ18_V3.1630 (box 1), indicated as 2-4b. 
Other primers 5a/5b to 8a/8b were used for long-range PCR. The genomic locus of the WT and triple mutant (GSH1/GSH1/NEO/HYG/BLA) are 
shown in this figure. (B) PCR amplification with the primers la and b using genomic DNA from WT and recombinants. PTR1 was used for 
normalization of template DNA. (C) Detection of novel junctions due to the rearranged genomic locus in the recombinants using primer pairs 2a/2b, 
3a/3b and 4a/4b. 1, WT L. infantum, GSH1/GSH1; 2, GSH1/GSH1/HYG; 3, GSH1 /GSH1/NEO; 4, GSHljGSHl /NEO /HYG; 5, GSH1/GSH1/NEO/ 
HYG/BLA. (D) Genomic DNA of the GSHljGSHl /NEO/HYG/BLA was amplified by long-range PCR (DyNAzyme™ EXT DNA Ploymerase, 



Finnzymes) to further validate the order of marker integration on chromosome IS 
primers 5a/6b (2); 5a/7b (3); 5a/8b (4); 6a/7b (5); 6a/8b (6) and 7a/8b (7). 



Lane 1, High Range DNA ladder; long-range PCR products with 



The data led us to suggest that the duplication occurs at 
the level of the direct repeated sequences. 

This was further supported by PCR amplification 
of the novel junctions created. The pair of primers la 
and lb (described in Figure 3) amplified a DNA 
fragment of 850 bp in the WT cells (Figure 8B, lane 1) 
but with more intensity in the recombinants (Figure 8B, 
lanes 2-5) with the same amount of template genomic 
DNA as normalized with amplification of PTR1 



(LinJ23_V3.0310). The presence of this novel junction in 
WT cells using a highly sensitive PCR assay suggests that 
recombination between these repeated sequences is a rela- 
tively frequent event (see 'Discussion' section). 

To confirm the rearrangement by tandem duplication in 
these GSH1 recombinants, we designed primers to amplify 
the novel junctions generated by duplication but by 
having forward primers before the end of marker genes 
(Figure 8A) and reverse primers were designed in the 
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middle of LinJ18_V3. 1630 (Figure 8A). These three sets of 
primers in the mutants yielded the expected product at the 
appropriate size by PCR (Figure 8C) and no band was 
detected using WT genomic DNA (Figure 8C). The 
PCR products were sequenced and the analysis was con- 
sistent with the rearrangements occurring at the level of 
the repeats (data not shown) and confirmed the process of 
tandem duplication on chromosome 18. The ordered inte- 
gration of all markers and duplication of WT locus on the 
same chromosomal homolog were verified by Southern 
blot analyses (Figures 5C and D and Figure 7). This was 
further supported by long-range PCR of the GSHlj 
GSH1/NEO/HYG/BLA recombinants (Figure 8D). 
Indeed, the NEO gene was 9.4 kb apart from HYG 
(Figure 8D, lane 2) and 19 kb from BLA (Figure 8D, 
lane 3) using long-range PCR. We failed, however, 
under our experimental conditions, to amplify the 27-kb 
fragment separating NEO and GSH1 (Figure 8D, lane 4). 
The use of long-range PCR confirmed that the HYG gene 
was 9kb apart from BLA (Figure 8D, lane 5) while the 
HYG and BLA genes were respectively 19kb and 9kb 
apart from the GSH1 gene (Figure 8D, lanes 6 and 7). 
Other primer combinations did not give rise to PCR 
products. These results confirmed the structure of 
chromosome 18 shown in Figures 6E and 8A. 



DISCUSSION 

We recently demonstrated that GSH1 is essential where 
the markers (HYG, NEO and BLA) integrated at the hom- 
ologous locus but the recombinants invariably retained 
two WT copies of GSH1. However, when an episomal 
copy of GSH1 was provided it was possible to generate 
a GSH1 chromosomal null mutant (14). 

Previous attempts to inactivate essential genes in 
Leishmania resulted in gene rearrangements such as super- 
numerary chromosomes (2) or genome wide polyploidy 
(13) or linear amplicons (19). For example, attempts to 
inactivate the dihydrofolate reductase-thymidylate 
synthetase (DHFR-TS) in Leishmania resulted in aneu- 
ploid trisomic lines, genomic tetraploids and diploids 
bearing homologous integration of the targeting 
fragment without replacement (2). In an attempt to inacti- 
vate trypanothione reductase (77?), both the WT alleles 
were disrupted but a third copy of TR was generated by 
genomic rearrangement, involving the translocation of 
TR containing region to a larger chromosome (12). 
Inactivation of the cdc2-related kinase (crkl) led the trans- 
fected fragment to form an episome, or the cloned trans- 
fectants were genomic triploids or tetraploids (13). In this 
study, we show a complete different mechanism of 
regional polyploidy emerging from tandem duplication 
of a segment bordered by direct repeats (Figure 6). 

Homologous recombination between direct repeated 
sequences is well known for its implication in the forma- 
tion of extrachromosomal circular elements (3,4,6,7,11). 
In our loss of heterozygosity attempts of GSH1, we 
observed circular amplification of the selectable markers 
(HYG and NEO) in GSH1/GSH1 /NEO /HYG cells in 
whose media the selective drugs were raised 10-fold 



(Figure 3C). These segments were amplified as extrachro- 
mosomal circles through HR between the 466-bp dir- 
ect repeats bordering Lin J 18 _V 3. 1630 and GSH1 
(LinJ18_V3.1660) (Figure 3C). The same 466-bp direct 
repeated sequences, present on chromosome 18, were 
used for tandem duplication of a four-gene region upon 
attempts to inactivate the essential GSH1 gene. This is a 
novel strategy for Leishmania to generate polyploidy. It is 
intriguing that in WT cells (Figure 8B, lane 1) we could 
detect (using a sensitive PCR assay) a new junction, sug- 
gesting that this rearrangement occurs continuously in a 
limited number of cells and is selected when we attempted 
to disrupt GSH1. It is salient to point out, however, that it 
is not possible to discriminate whether this recombination 
leads to duplication (as seen for GSH1) or as extrachro- 
mosomal circles since such events could give rise to 
the same amplification product (compare Figure 3C 
and Figure 8A). Ongoing work in the lab suggests 
that this background amplification in Leishmania at the 
level of repeated sequences is common throughout the 
Leishmania genome (manuscript in preparation). In 
eubacteria, gene amplification by gene duplication is 
frequent. In a growing population of bacteria the fre- 
quency of spontaneous tandem duplication ranges from 
10~ 2 to 10~ 4 , depending on the gene and the genomic 
region (20). These duplications provide a large reservoir 
of standing genetic variation from which selective pressure 
can favor cells with an increased copy number of any 
specific region. Once a tandem genetic duplication has 
been formed, selection for increased gene dosage may 
drive further amplification of the duplicated sequence, 
increasing the length of the genome by 10-80% in 
Salmonella and Escherichia coli (21-24). The mechanism 
of amplification from an initial duplication involves 
RecA-mediated nonequal crossing over between sequences 
on sister chromatids (20,25,26) or a rolling circle type of 
mechanism resulting in the formation of large tandem 
arrays (27). Intrachromosomal, clustered and tandem 
amplicons by palindromic duplication are also one of 
the somatic rearrangements in cancer cells, suggesting to 
occur by a breakage-fusion-bridge (BFB) mechanism 
(28,29). 

Successful integration of the targeting fragment at 
GSH1 was always accompanied by expansion of the 
targeted locus to include both the targeting DNA and 
the WT copy of GSH1, and, in every instance, the WT 
GSH1 gene was at the telomere proximal side of the 
targeted locus (Figure 6). Interestingly, this duplication 
was not observed when a WT copy of GSH1 was 
supplied on a plasmid. Gene targeting may thus occur in 
cells that had previously duplicated the GSH1 locus by 
unequal sister chromatid exchange mediated at the level 
of the direct repeats flanking the locus, and only those 
cells with pre-existing duplications survive selection for 
the marker on the targeting DNA. Indeed, unequal 
sister chromatid exchange has often been invoked for 
explaining naturally occurring tandem duplications in 
several organisms (30,31). If this were the case, however, 
one would expect that the duplicated alleles would be 
targeted randomly, but, instead, we observed a specific 
order of integration suggesting that the process of gene 
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Figure 9. Tandem duplication of a subtelomeric essential gene in Leishmania by break induced replication. (I) Inactivation of GSH1 by gene 
replacement with a targeting cassette by homologous recombination in the 5' and 3' flank involving a double cross over would result in the 
integration of the marker (M) and disruption of GSH1. (II) However, if the integration of the inactivation cassette into the homologous position 
happens by a single crossover to survive the selection pressure, it creates a double-strand break (DSB) on the chromosome (III). (IV) At the DSB, a 
5'-^ 3' degradation of DNA takes place, exposing single-strand DNA. Single-strand DNA could initiate pairing and strand invasion of homologous 
duplex DNA from the sister chromatid or the other homologous allele (red). The 3' flank of GSH1 containing the first 100 bp of the repeated 
sequences (black box) could misalign and pair with the repeated sequence of the homologous duplex DNA (yellow box) and synthesis begins. 
(V) DNA synthesis is extended and replication continues to the end of the chromosome yielding a chromosomal copy with a duplicated locus in 
tandem and the other one intact. Double arrowhead represents the telomere. 



targeting might lead to the duplication rather than simply 
taking advantage of a pre-existing duplication. 

The fact that GSH1 is flanked by direct repeated se- 
quences and is subtelomeric lead us to propose a model 
for duplication of the GSH1 locus involving partial inte- 
gration of the targeting DNA followed by break-induced 
replication (BIR), as shown in Figure 9. In the instance 
where targeted gene replacement occurs, loss of the essen- 
tial GSH1 gene leads to cell death, so these events are not 
observed (Figure 91). Instead, we propose that one end of 
the targeting DNA integrates into the chromosome before 
the other (Figure 911) and this single crossover creates 
a double-strand break (DSB) in the chromosome 
(Figure 9III). The concept of single crossover at one end 
of the targeting fragment has been previously shown in 
yeast (32). Double-strand breaks on chromosomes are 
lethal and must be repaired. Several sub-pathways 
involving HR have been defined for repair and these 
include double-strand break repair (DSBR), 



synthesis-dependent strand annealing (SDSA) and BIR 
(33). Repair of this DSB during gene targeting by either 
SDSA or gene conversion would lead to removal of either 
the targeting DNA or the GSH1 gene, so these events 
would not survive selection. Instead, we propose that the 
break is repaired by BIR, a mechanism well studied in 
Saccharomyces cerevisiae (34). During BIR, the 
centromere-proximal end of a break is processed into a 
3' overhang, which can invade the sister chromatid or 
the other copy of the chromosome using homologous se- 
quences and use it as a template for repair. DNA 
polymerase-mediated synthesis completes the repair (35). 
In Leishmania, duplication of the GSH1 locus occurs when 
the 466-bp direct repeated sequence on the left side of the 
break invades an identical 466-bp repeat upstream of the 
GSH1 locus on the sister chromatid/homologous chromo- 
some (Figure 9IV) and copies to the end of the chromo- 
some, resulting in a characteristic tandem duplication of 
the locus containing the wild type GSH1 gene on the 



7510 Nucleic Acids Research, 2011, Vol. 39, No. 1 7 



telomere proximal side, and keeping the other allele intact 
(Figure 9V). If the targeting construct did not contain any 
part of the 466-bp repeated sequences, we speculate that 
tandem duplication could still have occurred by BIR with 
two or more rounds of invasion (Supplementary 
Figure S2), a phenomenon already described in yeast 
(36). The presence of the repeats on the targeting DNA 
probably increases the likelihood of misaligned BIR dra- 
matically because it can occur on the first round of strand 
invasion and does not involve secondary steps, namely 
invasion, synthesis, strand displacement and re-invasion. 
Other mechanisms could also explain the gene duplication 
observed, notably unequal sister chromatid exchange, but 
this model parsimoniously explains the duplication and 
the specific orientation of the two copies of the duplicated 
locus, and is supported by the presence of the direct 
repeats flanking GSH1 and by the subtelomeric location 
of the GSH1 gene. The subtelomeric regions of eukaryotes 
are hotspots of genetic rearrangements and most of these 
events are best explained by the introduction of a DSB 
that is repaired by BIR using repetitive sequence 
elements located in the same or different chromosomes 
(36,37). Recent work has shown that a programmed 
double strand break in Trypanosoma brucei is an effective, 
natural trigger of variant surface glycoprotein (VSG) 
switching and the repair of DSB is probably achieved by 
BIR and a repetitive sequence adjacent to a VSG gene in 
the telomeric loci facilitate BIR through homology recog- 
nition (37). Also, compelling evidence was given to 
support the idea of duplicative gene conversion into an 
active VSG expression site for activation of a new VSG 
in T. brucei (38). BIR is also used for repair of shortened 
telomeres in the absence of telomerase in numerous in- 
stances (39). While these processes are mechanistically 
distinct from the duplication observed during targeting 
of the GSH1 locus, it was the prevalence of 
repeat-mediated, break-dependent subtelomeric DNA 
rearrangements in a wide variety of organisms, including 
T. brucei, that first suggested a possible model by which 
gene targeting might drive the duplications described here. 

Leishmania is a diploid organism with considerable 
genome plasticity. It was soon realized that targeting es- 
sential genes can lead to gene rearrangements (2,12,13,19). 
In this study, we provide evidence for a new type of re- 
arrangement leading to the duplication of a region at the 
level of repeated sequences. Sequence analysis of the 
Leishmania genome has revealed several gene duplication 
and it is believed that smaller gene families (<10 members) 
appears to have arisen from tandem duplication (40). 
Tandem duplication of a gene cluster (two to three 
genes) is also found but with a lesser frequency in some 
chromosomes and it could be possible that some of these 
events took place by intrachromosomal HR between 
repeated sequences resulting in repeat expansion. A pre- 
liminary analysis partially supports this hypothesis. We 
found six examples where a gene cluster (two or three 
genes) is duplicated in tandem intrachromosomally. Two 
of these clusters were likely duplicated by HR through 
these repeats. For the other clusters either we had no 
evidence for this event or they have undergone complex 
rearrangements which could have led to the loss of 



repeated sequences. A group of eight genes involved in 
lipophosphoglycan biosynthesis or modification (the 
SCGR and SCA genes) are clustered on chromosome 2 

(41) . The arrangement would suggest a duplication of 
four genes and indeed repeated sequences of 412 bp 
(98% identity) are strategically positioned to duplicate 
the locus (Supplementary Figure S3A). The other 
example we found is that of the tandem duplication 
of a hypothetical gene and the gene coding for 
S-adenosylmethionine synthetase on chromosome 30 

(42) . On inspecting the genomic locus we discovered 
direct repeated sequences of 212 bp (98% identity) at pos- 
itions consistent with gene duplication through HR at the 
level of 212-bp repeats (Supplementary Figure S3B). 
Expansion of gene families by tandem duplication by 
HR at the level of direct repeats is a potential mechanism 
by which parasites can increase gene copy number and this 
was shown experimentally here during gene inactivation 
attempts of GSH1. Further work should allow testing for 
additional roles for repeated sequences in gene duplication 
in Leishmania and whether unequal sister chromatid 
exchange and BIR (Figure 9) can explain such duplicative 
events. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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